Audio Features for Noisy Sound Segmentation
نویسندگان
چکیده
Automatic audio classification usually considers sounds as music, speech, silence or noise, but works about the noise class are rare. Audio features are generally specific to speech or music signals. In this paper, we present a new audio feature sets that lead to the definition of four classes: colored, pseudo-periodic, impulsive and sinusoids within noises. This classification relies on works about the perception of noises. This audio feature set is experimented for noisy sound segmentation. Noise-to-noise transitions are characterized by means of statistical decision model based on Bayesian framework. This statistical method has been trained and experimented both on synthetic and real audio corpus. Using proposed feature set increases the discriminant power of Bayesian decision approach compared to a usual feature set.
منابع مشابه
Sound analysis using MPEG compressed audio
There is a huge amount of audio data available that is compressed using the MPEG audio compression standard. Sound analysis is based on the computation of short time feature vectors that describe the instantaneous spectral content of the sound. An interesting possibility is the calculation of features directly from compressed data. Since the bulk of the feature calculation is performed during t...
متن کاملAudio scene segmentation using multiple features, models and time scales
In this paper we present an algorithm for audio scene segmentation. An audio scene is a semantically consistent sound segment that is characterized by a few dominant sources of sound. A scene change occurs when a majority of the sources present in the data change. Our segmentation framework has three parts: (a) A definition of an audio scene (b) multiple feature models that characterize the dom...
متن کاملSpeech/Nonspeech Segmentation in Web Videos
Speech transcription of web videos requires first detecting segments with transcribable speech. We refer to this as segmentation. Commonly used segmentation techniques are inadequate for domains such as YouTube, where videos may have a large variety of background and recording conditions. In this work, we investigate alternative audio features and a discriminative classifier, which together yie...
متن کاملJoint Object-Material Category Segmentation from Audio-Visual Cues
Figure 1: (a) The noisy predictions made by the per-pixel unary classifiers. (b) The output of the CRF using only visual features. (c) The use of auditory information improves material labeling. (d) Finally, joint optimisation between object and meterial categories improves object labelling as well. (e) The ground truth. (f) The input image, showing the locations where sound information is pres...
متن کاملAutomatic segmentation of speakers in broadcast audio material
In this paper, dimension-reduced, decorrelated spectral features for general sound recognition are applied to segment conversational speech of both broadcast news audio and panel discussion television programs. Without a priori information about number of speakers, the audio stream is segmented by a hybrid metric-based and model-based segmentation algorithm. For the measure of the performance w...
متن کامل